Model-Based Reinforcement Learning via Stochastic Hybrid Models
نویسندگان
چکیده
Optimal control of general nonlinear systems is a central challenge in automation. Enabled by powerful function approximators, data-driven approaches to have recently successfully tackled challenging applications. However, such methods often obscure the structure dynamics and behind black-box over-parameterized representations, thus limiting our ability understand closed-loop behavior. This article adopts hybrid-system view modeling that lends an explicit hierarchical problem breaks down complex into simpler localized units. We consider sequence paradigm captures temporal data derive expectation-maximization (EM) algorithm automatically decomposes stochastic piecewise affine models with transition boundaries. Furthermore, we show these time-series naturally admit extension use extract local polynomial feedback controllers from experts via behavioral cloning. Finally, introduce novel hybrid relative entropy policy search (Hb-REPS) technique incorporates nature optimizes set time-invariant derived approximation global state-value function.
منابع مشابه
Transferring Models in Hybrid Reinforcement Learning Agents
The main objective of transfer learning is to reuse knowledge acquired in a previous learned task, in order to enhance the learning procedure in a new and more complex task. Transfer learning comprises a suitable solution for speeding up the learning procedure in Reinforcement Learning tasks. In this work, we propose a novel method for transferring models to a hybrid reinforcement learning agen...
متن کاملReinforcement Learning: Model-based
Reinforcement learning (RL) refers to a wide range of dierent learning algorithms for improving a behavioral policy on the basis of numerical reward signals that serve as feedback. In its basic form, reinforcement learning bears striking resemblance to ‘operant conditioning’ in psychology and animal learning: actions that are rewarded tend to occur more frequently; actions that are punished ar...
متن کاملModel-Based Reinforcement Learning
Reinforcement Learning (RL) refers to learning to behave optimally in a stochastic environment by taking actions and receiving rewards [1]. The environment is assumed Markovian in that there is a fixed probability of the next state given the current state and the agent’s action. The agent also receives an immediate reward based on the current state and the action. Models of the next-state distr...
متن کاملReinforcement Learning using Kernel-Based Stochastic Factorization
Kernel-based reinforcement-learning (KBRL) is a method for learning a decision policy from a set of sample transitions which stands out for its strong theoretical guarantees. However, the size of the approximator grows with the number of transitions, which makes the approach impractical for large problems. In this paper we introduce a novel algorithm to improve the scalability of KBRL. We resor...
متن کاملModel-Based Probabilistic Pursuit via Inverse Reinforcement Learning
In this paper we address the integrated prediction, planning, and control problem that enables a single follower robot (the “photographer”) to quickly re-establish visual contact with a moving target (the “subject”) that has escaped the follower’s field of view. Our work addresses this unavoidable scenario, which reactive controllers are typically ill-equipped to handle, by making intelligent p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE open journal of control systems
سال: 2023
ISSN: ['2694-085X']
DOI: https://doi.org/10.1109/ojcsys.2023.3277308